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1. A method for network packet processing comprises: receiving network packets; and 
operating on the network packets with a plurality of program threads to affect 
processing of the packets; wherein the plurality of progra m threads are scheduler 
program threads to schedule task orders for processing and processing program 
threads that process packets in accordance with task assignments assigned by the 
scheduler program threads, the scheduler program threads can schedule any one of a 
plurality of processing program threads to handle processing of a task, the 
scheduler program thread writes a register with an address corresponding to a 
location of data for the plurality of processing program threads, a selected one of 
the plurality of processing program threads that can handle the task reads the 
register to obtain the location of the data, the selected one of the plurality of 
processing progra m threads reads the register to obtain the location of the data 
and to assign itself to processing the task requested by the scheduler program 
thread. 

10. A method for network packet processing comprises: receiving network packets; 
and operating on the network packets with a plurality of program threads to affect 
processing of the packets; wherein the plurality of program threads are scheduler 
program threads to schedule task orders for processing and processing program 
threads that process packets in accordance with task assignments assigned by the 
scheduler program threads, the scheduler progra m threads can schedule any one of a 
plurality of processing program threads to handle processing of a task, the 
scheduler program thread writes a register with an address corresponding to a 
location of data for the plurality of processing program threads, a selected one of 
the plurality of processing program threads that can handle the task reads the 
register to obtain the location of the data, the selected one of the plurality of 
processing tasks reads the register to obtain the location of the data, while the 
register is cleared by reading the register by the program thread to assign itself 
to process the task. 
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(57) 



ABSTRACT 



A parallel hardware-based multithreaded processor is 
described. The processor includes a general purpose proces- 
sor that coordinates system functions and a plurality of 
microengines that support multiple program threads. The 
processor also includes a memory control system that has a 
first memory controller that sorts memory references based 
on whether the memory references are directed than even 
bank or an odd bank of memory and a second memory 
controller that optimizes memory references based upon 
whether the memory references are read references or write 
references. A program thread communication scheme for 
packet processing is also described. 
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In state 718, a thread or other process on the selected processor begins processing 
the packet that was stored in the processor 1 s queue. Methods of processing a packet 
through its protocol stack are well known to those skilled in the art and need not 
be described in detail. The illustrated procedure then ends with end state 720. 
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LLP 

(57) ABSTRACT 

A system and method are provided for identifying related 
packets in a communication flow for the purpose of collec- 
tively processing them through a protocol stack comprising 
one or more protocols under which the packets were trans- 
mitted. A packet received at a network interface is parsed to 
retrieve information from one or more protocol headers. A 
flow key is generated to identify a communication flow that 
includes the packet, and is stored in a database of flow keys. 
When the packet is placed in a queue to be transferred to a 
host computer, the flow key and/or its flow number (e.g., its 
index into the database) is stored in a separate queue. Near 
to the time at which the packet is transferred to the host 
computer, a dynamic packet batching module searches for a 
packet that is related to the packet being transferred (i.e., is 
in the same flow) but which will be transferred later in time. 
If a related packet is located, the host computer is alerted 
and, as a result, delays processing the transferred packet 
until the related packet is also received. By collectively 
processing the related packets, processor time is more effi- 
ciently utilized. 

27 Claims, 49 Drawing Sheets 
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